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Abstract — A generic (r, m)-erasure correcting set is a col- 
lection of vectors in F2 wliich can be used to generate, for 
each binary linear code of codimension r, a collection of parity 
check equations that enables iterative decoding of all correctable 
erasure patterns of size at most m. That is to say, the only 
stopping sets of size at most m for the generated parity check 
equations are the erasure patterns for which there is more than 
one manner to fill in the erasures to obtain a codeword. 

We give an explicit construction of generic (r, m) -erasure 
correcting sets of cardinality l]™g^ ( 7 )• Using a random- 
coding-like argument, we show that for fixed m, the minimum 
size of a generic (r, m)-erasure correcting set is linear in r. 

Keywords: iterative decoding, binary erasure channel, stopping 
set 

L Introduction 

This paper is motivated by the following well-known 
scheme for iterative decoding of a binary linear code C used 
on the binary erasure channel [1]. We are given a set H of 
parity check equations for C. For a received word with E as 
set of erased positions, we inspect if one of the parity check 
equations from H involves exactly one of the erasures. If so, 
we determine the value of the erasure involved in this equation 
and continue; if not, we stop the algorithm. In the latter case, 
the set E is called a stopping set for 7i [1]. Different sets H of 
parity check equations for C may result in different stopping 
sets. Note, however, that the support of a codeword is always 
a stopping set, as by definition each parity check vector has 
an even number of ones within the support of a codeword. 

We are interested in the behavior of the iterative decoding 
algorithm for erasure patterns that are C -correctable, i.e., for 
which there is only one way to fill in the erasures to obtain a 
word from C. In fact, we wish to find parity check collections 
with which the iterative decoding algorithm decodes all C- 
correctable patterns of a sufficiently small cardinality. As 
related work, we mention that in [2], Weber and Abdel- 
Ghaffar construct collections of parity check equations for the 
Hamming code Cr of redundancy r with which the iterative 
algorithm decodes all -correctable erasure patterns of size at 
most 3. In [3] and [4], Schwartz and Vardy study the minimum 
size of collections of parity check equations for a code C for 
which the iterative algorithm decodes all erasure patterns of 
size less than the minimum distance of C (note that all such 
erasure patterns are C-correctable). 



In [5], we introduced and constructed so-called generic 
(r,m)-erasure correcting sets. These are subsets A of F2 such 
that for any code C of length n and codimension r, and any 
r X n parity check matrix H for this code, the collection of 
parity check equations 

{slH I aeA} 

allows the iterative decoding algorithm to correct all C- 
correctable erasure patterns of size at most m. Our aim is to 
construct generic (r, m)-erasure correcting sets of small size, 
and to investigate the minimum size F{r, m) of such sets. At 
first sight, the definition of generic (r, m)-erasure correcting 
sets seems to be very restrictive. However, in [5] it was shown 
that if a set of linear combinations works for a parity check 
matrix Hr of the Hamming code of redundancy r, then it 
works for any parity matrix for any code of redundancy r - 
see Proposition 2.6 of the present paper for a more precise 
formulation and a proof. 

The paper is organized as follows. In Section |ll| we in- 
troduce notations and definitions. In Section |llll we present 
the explicit generic (r, TO)-erasure correcting sets from [5]. 
These sets have size X^illo^ C^7^)- Section Hvl we show 
that F{r, m) > r. With a random-coding like argument we 
show that for each m > 1, there exist a constant such that 
F{r, m) < Cmf for each r > to. At present we do not have a 
constructive proof that F{r, to) is linear in r. ' 

II. Preliminaries 

In this section, we introduce some notations and definitions. 
Throughout this paper, we use boldface letters to denote row 
vectors. All vectors and matrices are binary. If there is no 
confusion about the length of vectors, we denote with the 
vector consisting of only zeroes, and with the «-th unit 
vector, the vector that has a one in position i and zeroes 
elsewhere. 

The size of a set A is denoted by | A|. If is a r x n matrix 
and EC- {1, 2, ... , n}, then the restriction H{E) of H to E 
denotes the r x matrix consisting of those columns of H 
indexed by E. Similarly, if x e and S C {1,2, . . . 
then the restriction x{E) of x to iJ is the vector of length \E\ 
consisting of the entries indexed by E. 

'In [6], we gave an explicit recursive construction, too involved to be 
included here, of generic (r, 3)-erasure correcting sets of size 



The support supp(x) of a vector x G is the set of its 
non-zero coordinates, that is, 

supp(x) = {i G {1, 2, . . . , n} I Xi 7^ 0}, 

and the weight wt(x) of x is the size |supp(x)| of its support. 

As usual, an [n, k] code C is a fc-dimensional subspace of 
the dual code of C, denoted by C^, is the [n,r] code 
with r — n — k consisting of all vectors in that have inner 
product with all words from C. The number r is referred to 
as the codimension or redundancy of the code. An rxn matrix 
is called a parity check matrix for C if its rows span C^. 
When we speak about "code", we will always mean binary 
hnear code. 

The following definitions are taken from [5]. 

Definition 2.1: Let C C F2 be a code. A set _B C 
{l,2,...,n} is called C -uncorrectable if it contains the 
support of a non-zero codeword, and C -correctable otherwise. 

The motivation for this definition is that a received word 
containing only correct symbols and erasures can be decoded 
unambiguously precisely when exactly one codeword agrees 
with this word in the non-erased positions; for linear codes 
this is the case precisely when the set of erasures does not 
contain the support of a non-zero codeword. 

Definition 2.2: Let C C F^' be a code. A set W C is 
called va-erasure reducing for C if for each erasure pattern 
E C {l,2....,n} of size m that is C-correctable, there 
exists a parity check equation hGH with exactly one 1 in 
the positions indexed by E, that is, with wt(h(i?))=l. The 
set Ti is called m-erasure decoding for C if it is m'-erasure 
reducing for C for all m' with l< m' < m. 

Definition 12.21 implies the following. If the iterative decoding 
algorithm is used with a set H of parity check equations 
that is TO-erasure reducing for C, then for each C-correctable 
erasure pattern of size m at least one erasure is resolved; if 
H is TO-erasure correcting for C, then the iterative decoding 
algorithm can correct all C-correctable erasure patterns of size 
at most TO by removing one erasure at the time, without ever 
getting stuck. Note that this definition makes no requirements 
on the behaviour of the iterative decoding algorithm for erasure 
patterns that are not C-correctable. 

The following example shows that an TO-erasure reducing 
set for a code C need not be an TO-erasure correcting set for 
C. 

Example 2.3: Let C be the binary [5,1,5] repetition code, 
and let Ti. consist of the four vectors hi = 10001, h2 = 01100, 
hg = 01111, and h4 = 01010. Note that Ti spans the dual 
code C"*- of C (which is just the even- weight code of length 
five). In the table below, we provide for each set of erasures 
of size four a parity check equation that has weight one inside 



this erasure set. 

non-erased position parity check equation 

1 hi 

2 h2 

3 h2 

4 hi 

5 hi 

The set Ti. is therefore 4-erasure reducing for C. It is, 
however, not 4-erasure correcting for C, as {2,3,4} is a 
stopping set that does not contain the support of a nonzero 
codeword. So for example the erasure set {1,2,3,4} is C- 
correctable, and can be reduced but not corrected by H. 

Finally, in [5] we introduced the notion of a "generic" TO- 
erasure correcting and reducing set for codes of a fixed codi- 
mension. The idea is to describe which linear combinations to 
take given any parity check matrix for any such code. 

Definition 2.4: Let 1 < to < r. A set C Fj is called 
generic (r, m)-erasure reducing if for any n > r and for any 
rxn matrix H of rank r, the collection {aH \ a G .4} is 
TO-erasure reducing for the code with parity check matrix H. 
The set ^ C Fj is called generic (r, m)-erasure correcting 
if it is generic (r, to') -erasure reducing for all to' with 1 < 

to' < TO. 

The following useful characterization of generic (r, TO)-erasure 
reducing sets has been obtained in [5]. 

Proposition 2.5: A set ^ C F2 is generic (r, TO)-erasure 
reducing if and only if for any r x m matrix M of rank to 
there is a vector aG^ such that wt(aAf ) = 1. 

The proof of this proposition can be outlined as follows. It can 
be shown that an erasure pattern E is C-correctable if and only 
if for any parity check matrix H for C, the restriction H{E) 
has full rank. Hence, we need only consider rxm submatrices 
of full rank, and each rxm matrix of full rank can occur as 
such a submatrix. Using Propostion 12.51 it can be shown that 
for all codes of a fixed codimension, the Hamming code is 
the most difficult code to design generic erasure reducing sets 
for The following proposition states this fact more precisely. 

Proposition 2.6: Let m < v. Let H,. be a parity check 
matrix of the [2'' — 1,2'' — r — 1,3] Hamming code Cr- A 
set ^ C F2 is generic (r, TO)-erasure reducing if and only if 
Ti — {aHr I a G A} is TO-erasure reducing for C,-. 

Proof: Combination of Proposition 12.51 and the fact that 
any rxm matrix of rank to occurs, up to a column permu- 
tation, as a submatrix of i/,.. ■ 

We are interested in generic (r, to) -erasure reducing sets of 
small size. This motivates the following definition. 

Definition 2.7: For 1 < to < r, we define F{r,m) as the 
smallest size of any generic (r, TO)-erasure reducing set. 

Note that Proposition 12.51 implies that Fj \ {0} is generic 
(r, TO)-erasure reducing, so E{r, to) is well-defined. 

Example 12.31 shows that for particular codes C, the notions 



"m-erasure reducing for C" and "m-erasure correcting for C" 
need not be the same; as we proceed to show, the notions 
"generic (r, m)-erasure reducing" and "generic (r, m)-erasure 
correcting", are, somewhat surprisingly, equivalent. 

Proposition 2.8: Let 2 < m < r. A generic (r, m)-erasure 
reducing set is a generic (r, m — 1) -erasure reducing set. 

Proof: Let ^ be a generic (r, to) -erasure-reducing set. 
Let M be a binary r x (m — 1) matrix of rank to — 1. We 
write 

M = [Mo I x^] , 

where denotes the rightmost column of M. Let be a 
vector in that is not in the linear span of the columns of 
M, and let M' denote the r x m matrix defined as 

Af'=[A/o|y^|xT+yT]. 

As M' has rank to, there exists a vector aG^ such that 
wt(aAf') = 1. We claim that wt(aAf) = 1. This is clear 
if wt(aAfo) = 1, as then ax^ = ay^ = 0. If elMq = 0, then 
ay^ = and a(x^ + y^) = 1, or vice versa. In either case, 
ax^ = ay^ +a(x^ +y^) = 1, from which we conclude that 
in this case also aM has weight I. ■ 

Note that Proposition 12.81 implies that the parity check equa- 
tions induced by a generic (r, TO)-erasure reducing set can also 
be used to resolve an erasure from a correctable erasure set 
of size TO — 1, TO — 2, ... (as shown in Example 12.31 this need 
not hold for a specific TO-erasure reducing set for a specific 
code). In other words, the following proposition holds. 

Proposition 2.9: Any generic (r, TO)-erasure reducing set is 
a generic (r, TO)-erasure correcting set. 

According to Proposition 12.91 the terms "generic (r, to)- 
erasure reducing" and "generic (r, to) -erasure correcting" can 
be used interchangably. In the sequel, we use "correcting", 
and base our results on the characterization given in Proposi- 
tion |531 

III. Explicit GENERIC (r,TO)-ERASURE correcting 

SETS 

In this section, we describe generic (r, TO)-erasure correct- 
ing sets Ar,m for all r and to with r > m > 2 (see also 
[5]). We will show that the sets Ar,3 are closely related to the 
sets found by Weber and Abdel-Ghaffar [2]. Modifications and 
generalizations of these sets can be found in [5] and [6]. 

Theorem 3.1: Let 2 < m < r. The set Ar,m defined as 

Ar,m = {a = (ai, 02, . . . , ar) e F2 I ai = 1 and wt(a) < to} 

is a generic (r, TO)-erasure correcting set of size 




Proof: As Ar,7n consists of all vectors that start with a 
one and have weight at most to — 1 in the positions 2,3,. . . ,r, 
the statement on the size of Ar.m is obvious. 



In order to show that Ar.m is indeed generic (r, TO)-erasure 
correcting, we will use Proposition l2.5l So let M be an r x to 
matrix of rank to. We have to show that there is a vector 
aiiAr,m such that wt(aAf)=L To this end, we proceed as 
follows. For 1 < i < r, let irii denote the i-th row of M. 
Let / C {1, 2, . . . , r} be such that {rrii | i G /} forms a basis 
for F™. We distinguish two cases. 

(i) : mi ^ 0. 

We can and do choose / such that le /. The set {J^iei ^i^i I 
{xi)i(zi,xi = 0} is an (to — 1) -dimensional space and hence 
cannot contain all unit vectors. So there exists a vector x = 
(xi)ig/ with xi — 1 and wt{J2i^i XiUni) = 1. Now, let aGF2 
be the vector that agrees with x in the positions indexed 
by / and has zeroes elsewhere. Then ai = xi = 1 and 
wt(a)=wt(x) < TO, hence a^Ar,m and aM = J2l=i ^4^^ = 
Y^i^i Xiiiii, so wt(aA/) = 1. 

(ii) : mi = 0. 

In this case 1 ^ /. As {m.i | i G /} forms a basis, there 
are independent vectors x(j) = i^iij) I * G -^) such that 
Bj — J2iei foJ" j- there is just one vector x 

of weight TO, and there are to > 2 unit vectors, there is an 
index j such that wt(x(j)) < to — 1. Now, let a be the vector 
that agrees with x(j) in the positions indexed by /, has a 
"1" in the first position, and zeroes elsewhere. As wt(x(j)) < 
TO — 1, the vector a is in Ar,m- Moreover, we have that aAf = 
J2i=i O'i^i = + J2i<^i airxii = + ej = e^. ■ 

We now relate our result for to = 3 to that of Weber and 
Abdel-Ghaffar [2], which in our terminology states that 

Wr = {ei\l <i<r}U{ei+ei + ej \ 2 <i<j < r} 

is generic (r, 3)-erasure correcting. To this end, let S be the 
matrix with the all-one vector as leftmost column, and with 
ej as j-th column for 2 < j < r. Clearly, S is invertible, and 
for 2< i < r and 2 < j < k < r, we have that 

ei5 = ei, (ei + ei)S = e^, and 

(ei + Bj + ek)S = ei + Gi + 

As a consequence, we have that 

Wr = {aS I a G Ar,3}- 

So Wr and Ar,3 are related via an element-wise multiplication 
with the invertible matrix S. 

IV. Upper and lower bounds on F{r, to) 

In this section, we show that F{r, to) is of linear order in 
r. To be more precise, we will show that for each to > 1, 
there exists a constant such that for each r > to, we have 
that r < F{r, to) < c,„r. 

Concerning the lower bound, we have the following lemma. 

Lemma 4.1: Any (r, TO)-erasure decoding set spans Fj. As 
a consequence, F(r, to) > r. 

Proof: (cf. [5]) Suppose ^ C Fj is such that span(^) ^ 
Fj. We will show that A is not generic (r, TO)-erasure decoding 
by constructing an r x to matrix M with rank to such that 



for each aG^, the vector aM does not have weight 1 (cf. 
Propostion l2.5> . 

Let V be a non-zero vector that has inner product with all 
words from A. Let AI be an invertible matrix such that the 
i-th row of M has odd weight if and only if i e supp(v), 
and let a.^A. As (v, a) = 0, the vector ai\/ is the sum of an 
even number of (odd weight) rows of M indexed by integers 
from supp(v), and some (even weight) rows of AI indexed 
by integers outside supp(v). As a consequence, aM has even 
weight. ■ 

The proof for the upper bound (cf. [6]) can be considered to be 
a random-coding argument: we will show that the collection 
of all subsets of F2 of a sufficiently large size contains at least 
one generic (r, TO)-erasure correcting set. The precise result is 
as follows. 

Theorem 4.2: For all m > 1 and r > m, we have that 



F{r, m) < 



m 



-log2(l - rn2-™) 

where log2 denotes the base-2 logarithm. 

Proof: Let 1 < m < r. We write Aim.r to denote the 
collection of all binary r x m matrices of rank m. Let N 
be some positive integer. Consider the following experiment. 
We randomly construct a binary N x r matrix A by setting 
each individual entry to zero or to one, each with probability 
1/2. We interprete this matrix as a sequence of N row vectors 
ai, . . . ,SiN, each of length r. For each matrix Ad in Aim,r, 
we define the random variable Xm by 

0, if there is an i such that wt{aiM) ~ 1; 

1, otherwise. 



— 



So Xm = if the matrix AI is "good" with respect to the 
vectors ai, . . . , aAr, and Xm = 1 if AI is "bad". 
Furthermore, let the random variable X be defined as 



X — ^M- 

MeM„^,r 

The random variable X thus counts the number of bad 
matrices with respect to A; \f X < 1, then all matrices 
are "good" with respect to A, so that the collection A = 
{ai, . . . ,aAr} C F2 satisfies the criterion in Proposition 12.51 
Consequently, if E[X] < 1, then all matrices in Mm,r are 
good with respect to some matrix A, and so F{r, m) < N. 

In order to compute E[X], we start by fixing a matrix M G 
Mm^r and compute the probability Prob(XM = 1) that Xm 
is equal to 1. As M has full rank, there are, for each i = 
1,2, ... ,m, exactly 2''"™ vectors a such that aM — e^. We 
conclude that there are m2^~™' "good" vectors for AI, and 
hence 2™(1 — m2~'") "bad" vectors. Now the matrix AI is 
bad if all the vectors ai, . . . , ajv are bad; we conclude that 



Prob(XM = 1) = (1 - m2 



-rn\N 



Since expectation is a linear operation, we have that 

E[X] = ^[^Af] = l-^^.-Kl - m2-'")^, 

from which we conclude that E[X] < 1 if and only if 

-log2(l " to2-™) 

As a consequence of the foregoing, we have that F{r, m) < 
N if N satisfies Q; as |A^m,r| is at most 2""^, the cardinality 
of the set of all r x m matrices, it follows that F{m, r) < N 
whenever 

m 

N > 



-l0g2(l - 7712-™) 



V. Conclusions 

In this paper, we have introduced the notion of generic 
(r, m) -erasure correcting sets in WTy, such sets provide for each 
binary code C with redundancy r a collection of parity check 
equations for C that can be used to iteratively correct all C- 
correctable erasure patterns of size at most m. We provided an 
explicit construction of generic (r, m) -erasure correcting sets 



of size J2Zo 



m-l /r-l^ 



, generalizing the result for m = 3 from 



[2]. We also showed, by a random-coding-like argument, that 
for each fixed m, the minimal size of generic (r, m) -erasure 
correcting sets is linear in r. 

The main remaining problem is to find explicit constructions 
for (r, TO)-erasure correcting sets of size linear in r, especially 
in the first open case m ~ 3. In [6], we provide an explicit 
recursive construction of (r, 3)-erasure correcting sets of cardi- 
nality l + 3(r — 3 _ jjQj linear in r, but much smaller than 
the (r, 3) -erasure correcting sets from Section III for which the 
cardinality is quadratic in r. 
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